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(54) Dynamic cache preloading across looselyHCoupled administrative domains 


(57) A caching server that provides faster access 
times for independently operating network elements. 
The caching server initiates information transfer and 
holds the requested information in its memory, instead 
of caching information transfer in response to user re- 
quests. The caching server prebads information from 
another server into its memory based on a set of prede- 
termined criteria. Such preloading preferably occurs 
during low usage time, such as nighttime. The informa- 
tion source determines which information is described 
in the catalog in accordance with one of a second set of 
predetermined criteria. For example, the catalog may be 
organized according to size and the information source 
will organize the catalog by file size. The caching server 
preloads as many often-used files as will fit in its cache 
memory 
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Description 

This application relates to networking and, specifi- - 
cally, to a method and apparatus for. caching in a net- 
work where the elements of the, network are operating/ s 
independently. . .. ~ 

Recent years h§ve seen a, huge rise in the popular- 
ity of network systems, such as .the internet. A network 
is formed of independently operating elements that are 
interconnected^ a way that allows them to transmit ; in- 10 
formation. For example, a first network element can 
transmit World Wide Web pages to. a second network 
element. The World Wide Web allows a userto use "web 
browser" software on his computerrsystem to view "web 
pages' stored on other systems in the network. Similar- is 
ly, network elements can transfer files to other network 
elements via File Transfer, Protocol (FTP). 

The amount of information transmitted over net- 
works such as the internet.has grown and will continue 
to grow in the future. While the overall amount of infor- 20 
mation being transferred has grown, the amount of in- 
formation being transferred during a day still varies at 
different times of the day. A time of day when the most 
information is being transferred is palled a "peak usa^e 
time." A. time of day ,when a relatively smallamount of 2s 
information is being transferred is called a "low usage 
time." For example, a network that allows a consumer 
to download information to a personal computer using 
a cable modem might have a peak usage time when 
people get home f rom.work and a low usage time in the 30 
middle of the night when- people are asleep. It is desir- 
able that usage of the network be ; evened out as much 
as possible.-.lt is also desirable that response time be 
increased; as much as possible. ». <- ? . : - . , , * .*. . - . ;! 

Moreover, in conventional networks, various tele- 35 
ments of the network have different goals. For example, 
in the above; mentioned consumer network, 3 centrally 
located network element may have a goal of sending as 
much information as possible out over the network. Jn 
contrast, a "local" - computer may. have a goal of giving 40 
"its" users the best response time possible. < <• 4 . - 

Local computers often "cache" incomingMnforma- 
tion. For example, a web browser may save in its mem- 
ory a copy of the ten most recent pages viewed by the 
user. That way, if the user .wants to view any^of the 45 
cached web pages, the browser software does not have 
to send a request for the web page out over the network: 
Instead, the browser can merely get the cachedjweb 
page from its memory. The browser can access its mem- 
ory much faster than it can request a web page and re- bo 
ceive it over the network. Thus, caching saves the time 
otherwise required to fetch the web page from over the 
network. Local computers often cache the most recently 
accessed information so that the information will .be 
available should the user request, it again. A problem ss 
associated with conventional caching is that the user 
must wait for information to be downloaded into the local 
computer when the user makes a first request for the 


information. (Caching is performed in response to a user 
requestor information). A second problem is that the 
information to be cached is often requested and down- 
loaded during : peak usage times. 

Particular and preferred aspects of the invention are, . 
set out in the accompanying independent and depend- - 
ent claims. Features of the dependent claims may be 
combined with those of the independent claims as ap- 
propriate and in combinations other than those explicitly 
set out in the claims. 

The present invention oyercpmes theproblems and 
disadvantages of the prior artiy implementing a each-.' 
ing server that provides faster access times for inde- 
pendently operating network elements. Such independ- 
ently operating network elements are said to have 
" loosely coupled admin ist rati ve domains. " I n a preferred 
embodiment of the present invention, the caching server 
initiates jnformation transfer and holds the requested in-, 
formation in its memory; instead of caching information 
transfer in response to user requests. ; 

In a preferred embodiment of the present invention/ 
the caching server preloads information from another 
server into its memory based on a set of predetermined 
criteria. For example, the caching server may download 
as many of the largest files as will fit in its cache memory. 
Such preloading preferably occurs during low usage 
time. Specifically, the caching server, requests a "cata- 
log" of available information from an information source, 
such as another server The information source sends 
the catalog over the network. The caching server then 
decides which information it wants to preload and re- 
quests that information from the information source. 
Once the information , is received, the caching server 
holds the information in a cache memory so that the in- 
formation will be available when requested by a user. 

Cache preloading has two main benefits: 1 ) it elim- 
inates the latency time associated with accessing infor- 
mation requested by a user for the first time and 2) the 
information; fetch between - the information source and 
the caching sejver can take place during off-peak hours, 
thereby, reducing the network load at peak times. 

In a preferred embodiment of the present invention, 
the information source determines which information is 
described in the catalog in accordance with one of a set 
of predetermined criteria. For example, the catalog may 
be, organized, according to, size and the information 
source will organize the catalog by file size. As a second 
example, the catalog may be organized according to the 
files most-otten downloaded by users. In this case, the 
caching server wijl preload as many often-used files as 
will fit in its cache memory. As, a third example, the cat- 
alog may be organized according to which files adver- 
tisers have paid to place in the catalog. In this case, the 
caching server will preload as many of these files as will 
fit in its cache memory. ■ , 

In accordance with an aspect of the invention, there 
is provided a method for quickly accessing information 
in a network that includes an origin system connected 
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to a caching server, the method comprising the steps, 
performed by the caching server of: requesting a catalog 
from the origin system that lists documents available 
from the origin system; receiving the catalog from the 
origin system; requesting a document described in the 
catalog, the document requested according*) ^"prede- 
termined criterion of the caching server; receiving the 
document from the origin system; and storing the doc- 
ument in a cache memory of the caching server. 

In accordance with another aspect of the invention, 
there is provided a caching server having a cache mem- 
ory, the caching server being part of a network that in- 
cludes an origin system connected to the caching serv- 
er, the caching server comprising: a first portion config- 
ured to request a catalog from the origin system that lists 
documents available from the origin system; a second 
portion configured to receive the catalog from the origin 
system; ^ third portion configured to request a document 
described in the catafog, the document requested ac- 
cording to a predetermined criterion of the caching serv- 
er; a fourth portion configured to receive the document 
from the origin system; and a fifth portion configured to 
store the document in the cache memory. 

Advantages of the invention will be set forth in part 
in the description which follows and in part will be obvi- 
ous from the description or may be learned by practice 
of the invention. The objects and advantages of the in- 
vention will be realized and attained by means of the 
elements' and combinations particularly pointed out * in 
the appended claims and equivalents. 

Exemplary ^embodiments of the invention are de- 
scribed hereinafter, by way of example only with refer- 
ence to the'accompanying drawings, in which: 

Figure 1 is a block diagram of a data processing 
system in accordance with a preferred embodiment of 
the present invention. 

Figure 2 is a diagram showingsteps^f a method 
for preloading a caching server. - ^ ; : * 

Figure 3 is a diagram showing steps of : a method 
for fulfilling a 'user's' request for information.-- " - 

Figures 4a- through 4c show examples of possible 
catalog formats: ' - ; v .- . 

Reference will now be made ; in detail to the pre- 
ferred embodiments of the invention, examples of which 
are illustrated in the accompanying drawings; Wherever 
possible, the same reference numbers will be used 
throughout the drawings to refer ft5 the Same *6r like 
parts.-" : * ■ \ » 

Figure 1 is a block diagram of a computer "network 
^100 in accordance with a preferred embodiment of the 
present invention. Corhputer network 1 00 includes an 
origin computer system 101 and a caching computer 
system 1 51 Origin system 101 andcaching system- 1 51 
are connected via connection 130. The computer sys- 
tem 100 can be any type of network, such as "the inter- 
net; a LAN, a WAN, an intranet, etc. , that allows systems 
101 and 151 to communicate with each other. Origin 
system 101 and caching system 151 operate independ- 


ently and communicate using any known protocol, such 
as Hyper Text Transfer Protocol (http). Although only 
two systems 101 and '151 "are shown, it should be un- 
derstood that additional systems can be included in 
5~- corhputer 1 network' 100 without departing from the the 
present invention. 

Origin computer system 101 includes^ processor 
102- and a memory 104. Memory 104 includes origin 
server software 1 1 2 and a catalog 1 1 4". Catalog 1 1 4 con- 
10 tains, for example; a description of information available 
from origin system 101.. System 101 preferably con- 
nects to a display device 132 and an input device 134, 
which can be any of a preferably connects to a display 
device 132 and to arn in puT device T34, "which can be 
is . any of a wide range of vah/in^l/O devices, such as disk * 
drives; keybdards, modems, network adapters/ printers, 
and displays. Respective computer readable devices -■ 
136, such as a disk drive or*CD ROM drive, are shown 
connected to origin system 101 and caching system 
26 151 ; Origin server software 112 and Caching server: 
software 162 are preferably loaded into memory via de- 
vices 136. * ' - 'L tj^. 

Caching computer system 151 includes a processor 
T52 : and'a*memory 154. Memory 154 includes caching ' 
25 server software 1 62 and a catalog .1 64 transmitted from 
origin system-1 Ot . Figure 1 also shows a cache memory 
174. Gache^emory '174 may be a part of memory 104* 
(as shown) or may be a separate memory: Cache mem- 
dry 174 has a predetermined maximum size^ln the de- 
30 scribed embodiment, caching system 151 is connected 
to client#1 160 and dient#2 192.rln the described em- * 
, bodiment, client#1 1 90^hd caching 1 system 151 jointly 
form *S caching -server T94-^ In other embodirhents, \ c\\< 
ent#1 may be located in either-the same or^ardifferent 
35 computers caching systems 51 . In still other preferred 
emb<5dimertts, cachirig system 151 perforrrWthe func- 
tions of client#1 190.<Client#2 may be located in either 
. the s^rrie** a different computer as caching system 
151:' ^ e - * s 'c \.o tr.. : . - . n ■ .■ 
40 r. Preloading of caching system 151 is allowed from 
origin system lOI^-where the priorities of the entities ad- 
ministrating the systems 101 and are 151 different. For 
example, the origin system 101 will want to "push out" 
as much information as possible; while the caching sys- 
"45 tern 151 is willing only to cache a certain amount of in- 
formation. Origin system 101 may be, for example,. a 
large, ; centrally located-server that stores a large number 
of httfr documents, such as web pages and files. Gach-. 
ing system 151 may be, for example, ^regional server 
50 accessible by clients in a geographical region. Atypical 
network may include multiple origin systems 101 and/or 
multiple caching systems 151. : 

Figure 2 is a diagram showing steps of a method 
for preloading caching system 151 in accordance with 
55 a preferred embodiment of the present invention. The 
figure is divided into three areas, representing, respec- 
tively, Client#1 1 90, caching system 151 and origin sys- 
tem 101. in the described embodiment, the client and 
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the servers communicate with one another via the bttp - 
protocol, although other suitable protocols could be 
used. Processor 152 executes caching server, software 
162 to perform the steps of the middle column. Proces- 
sor 102 executes origin server software 112 to perform 
the steps of the right-most column. Ciient#1 i 90 and cli- 
ent#2 192 preferably contain a processor executing 
software performing the illustrated steps. As discussed 
above, Figure 1> shows client#1 1 90 as a separate proc- 
ess, which may or may not run on a separate machine. 
In an alternate embodiment, the functions of client#1 
190 are performed by caching server software 162. 

In step 202, client* 1 190 initiates a request for a 
catalog to caching system 1 5.1 . In the described embod- 
iment, client#1 190 sends such a request at a predeter- 
mined time, such as in the middle of the night. It is de- 
sirable that the request of step. 202 be performed at a 
time when the network is not busy. Such timing helps to 
reduce network load at peak times. In step 204, caching 
system 151 sends a request for a catalog to origin sys- 
tem 101. In step 206, origin system 101 sends the cat- 
alog 114 to caching system 151. In step 208, caching 
system 151 sends the catalog to client#1 190. 

Steps 210 .through 220 are performed multiple 
times until client#1 190 determines that cache memory 
174 is full (or that^some maximum amount of cache 
space is filled). Alternately, client#1 1 90 may send a list 
of information to preload to caching system 1$1 and 
caching system 151 will load as many of the jtems on 
the list as will fit in cache memory 174. In step 210, cli- 
ents 1 90 requests an item in the catalog from caching 
system 151.- Caching system. 151 sends the request to 
origin system 101 . which returns the requested item. 
Caching system 1 51 caches the returned item in cache 
memory 174. Figure 2 shows , that the, item is then for- 
warded to client#1 1 90, which discards it, but may keep 
information about the item, such as the size. Alternately, 
caching system 151 may simply cache the item without 
sending it tOrClient#1 1 90. Thus, at the end of step 21 6, 
information has; been preloaded into caching system 
151 and is awaiting user requests for the information. 

Figure 2 also shows a step 230 in .whicl\client#1 
190 sends special instructions to, caching system 151. 
These special instructions may, for example, . specify 
that certain information in cache memory 174 should not 
be discarded from cache memory for a predetermined 
amountof time. As another example, the special instruc- 
tion may instruct caching system 1 51 to immediately dis- 
card certain; information in cache memory 1 74. The in- 
formation may be specified, for example, by its URL 
(Uniform Resource Locator). Alternately, the special in- 
struction may instruct the caching system 15Vto delete 
(or. save) information in. a last-in, first-out order or in 
some other appropriate order. 

Figure 3 is a diagram showing steps of a method . 
for fulfilling a user's request for information. The steps 
of Figure 3 are performed after the caching system 151 
has preloaded some information into cache memory 


174, as shown in Figure 2. In the example, the user is 
logged" onto client#2 192. In step 302, the User requests 
information such as a file or a web page by URL. If, in 
step 304, the requested information has-been preloaded 
s and exists in cache memory 1 74, the information is cop- 
ied from cache memory 1 7 A and sent to client#2 1 92. If, 
in step 308, the information was not preloaded, then, in 
step 310, caching system 151 requests the information 
from origin system 101 , which returns the requested in- 
fo formation. In step 314, caching system 151 stores the 
information in cache memory 174 and, in step 316, 
sends the information to the requesting client. Caching 
system 151 determines whether certain information is' 
present in cache memory 174 via any of a variety of 
is known cache memory techniques. 

Note that step 31 4 may involve purging (discarding) 
items from cache 174 to make room" for new items. If 
caching system 151 has received special " instructions 
not to. discard certain cached information, then in step 
20 31 4, that information will not be discarded from cache, 
even if it would normally be next in line to be discarded. 

Figures 4a through 4c show respective examples 
of formats of catalog 11 4. Each example format includes 
a section, or row in the catalog for each file or URL that 
25 is available to be downloaded Irom the origin system. 
These formats are provided by way of example only and 
are not meant to limit the scope of the present invention. 
Catalog 11 4 is preferably an http document of type text/ 
plain." Figure 4a shows a format in which catalog 114 
30 includes, two fields: a Uniform Resource "Locator (URL) 
field 402 and a size field 404. The catalog may optionally 
be organized by increasing or decreasing size. 

Figure 4b shows a format in which catalog 114 in- 
cludes three fields: a URL field 410, a size field 41 2, and 
3S a "# of times downloaded" field 41 4 (also called a "pop- 
ularity" field). Alternately, the popularity field 414 could 
be omitted and the catalog could simply be sorted by 
ascending or .descending "popularity value. 

Figure, 4c shows a format in which the catalog in- 
40 cludesthree fields; a URLfield 420, a size field 412, and 
a "desirability field." Alternately, the desirability field 412 
could be omitted and the catalog could simply be sorted 
by ascending or descending desirability. In this exam- 
ple, desirability indicates a criteria defined in origin sys- 
f$ tern 101. For ^example : if an advertiser had paid a pre- 
mium to have his information placed into the catalog, 
the information would have a high desirability value in 
. the catalog. Qther examples of catalogs might have the 
URLs sorted according to their PICS rating, or any sim- 
. so j|ar rating which rates the content of files. ^ 

Caching system p 1 51 reviews the contents of cata- 
log 114 and determines which available information de- 
scribed in catalog 114.it should preload. This determi- 
nation is made in accordance with a number of criteria 
, ss stored in the memory of caching system 1 51 . A simple 
criteria, for example, is to simply preload files in the cat- 
alog by size, largest first, until cache memory 174 is full 
(or a predetermined cache size limit has been reached). 
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Another.possible criteria might be that caching system 
1 51 preloads according to both the size of the file, the 
available cache space, and the 'popularity" or 'desira- 
bility' of the file. Another possible criteria might be that 
the caching system 151 preloads in accordance with 
one of the criteria described above, but does not under 
any circumstances, download information on a "do not 
download" list stored in its memory. For example, certain 
caching systems 151 might never want to download 
adult material or material from a predetermined informa- 
tion source. As can be seen "from the above examples, 
the criteria used to determine what information to down- 
load is affected somewhat by what infbrmation is avail- 
able in catalog 114. A large number of catalog formats 
and caching system criteria are compatible with embod- 
iments of the present invention. t , / ; 

. In "some embodiments, origin system 101 gives 
■hints' to the caching system/client as to when it should 
next request a catalog. Theses hints rhay be sent as part 
of the catalog or may 'be sent in a separate administra- 
tive message. For example, If the catalog is normally 
updated' every twenty^our hpurs, origin system 101 
might suggest that the catalog be downloaded by the 
caching system every twenty-four hours. If, however, 
some important event is occurririg/origin systerri 101 
rriight suggest that the catalog be downloaded more fre- 
quently because it is being updated more frequently by 
origin system 101. " \ 

- In a preferred embodiment; if origin system 101 is 
accessed by multiple caching servers, a different cata- 
log can be sent to each caching seiryer 1 94:The catalog 
is dynamically generated based on, for example, the ac- 
cess pattern generated by the particular caching server 
"as observed by the origin system. Alternately/ catalogs 
are determined by policy decisions at the origin system 
Tl 01 " For example, the catalogs sent to a certain caching 
server may not contain adult material. 

In summary, an embodiment of the invention allows 
a caching server, preferably iriclucling a* client and a 
caching system, tojrirtiate a request for a catalog of 
available http documents. This requestis made at night 
or during a slow time on trie network. After the catalog 
is received, the caching server decides which of the doc- 
uments in .the catalog to preload into a'cache memory. 
^Thus, the preloaded documents will be available in f the 
cache memory of the caching server when a user re- 
quests them. The caching server can decide that'some 
or all preloaded documents should not be purged from 
its cache for a predetermined time. \ 

Other embodiments will be apparent to those skilled 
in the art from consideration of the specification and 
practice of the invention disclosed herein. It is intended 
that the specification and examples' be considered as 
t exemplary only. 


Claims 

1. V A method for accessing information in a network 
' that includes an origin system connected to a cach- 
5 irig server, the* rrtethod comprising the steps, per- 
formed by the caching server of: 


• requesting a catalog from the origin system that 
lists documents available from the origin sys- 
io ' tern; ' 

*\ receiving the catalog from the origin system; 
1 ' requesting a document described in the cata- 
log, the document requested according to a 
•predetermined criterion of the caching server; 
1 s 1 receiving the document from the origin system; 

' and 

storing the document in a cache memory of the 
caching server. * ■ ' • r ' ' 

20- 2. The' method of claim 1 , further including the steps - 
'of: - ' - ; : 


receiving, by the caching server, a request for 
the document-f rom a user; and 
sending, by the 'caching server, the document 
stored in the cache memory teethe user. / 


2S 


30 


35 


40 


3. The method of claim 1 , wherein the step of request- 
ing a catalog from the origin system performed dur- 
ing a low usage tirhe of the network. - - 

4. The'method of claim 1 , wherein the cachrrig server 
' includes a client and a caching system and wherein 

1 the v step of requesting a catalog from the origirtfsys- 
tern includes the steps' of : - i => 

sendingthe request forthe catalog from the.cli- 
1 ent to the caching system; and 
- sending the request for the catalog from the 
' caching system to the origin system. 


5. The method of claim 1 , wherein the caching server 
includes a client and a caching system and wherein 

' the step of requesting a document described in the 
'45 : catalog includes the steps of: 

sending the request for the catalog from the cli- 
v * ent to the baching system; and 

sending the request for the catalog from the 
so caching system to the origin system. 

6. '-The method of claim 1 , wherein the criterion used 

by the caching server to determine- whether, to re- 
quest a document from the catalog is based on the 
55 r cv'size of the document. 

7. The method of claim 1 , wherein the criterion used 
' * by the caching server to determine whether to re- 
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quest a document from the catalog is based on a 
rating associated with the document and stored in 
the catalog. 

8. The method of claim 1 , wherein the criterion used s 
by the caching server to determine whether to re- 
quest a document from the catalog is based on a 

list of non-acceptable document names. 

9. The method of claim 1 , wherein the network in- 10 
eludes a second caching server and the first and 
second caching servers receive different catalogs 
from the origin system. 


15 


20 


25 


10. The method of claim 1, wherein the network in^ 
eludes a second caching server and the first and 
the second caching servers request different docu- 
ments from the origin system. 

11. The method of claim 1, wherein the origin system 
and the caching server use http protocol to commu- 
nicate. 

12. The method of claim 1, wherein the caching server 
includes a client and a caching system and further 
including the steps of: 

sending, by the client to the caching system, an 
instruction to retain the document in the cache 
memory for a specified amount of time; and 
retaining the document in the cache memory, 
by the caching system, for the specified amount 
of time. " 


13. The method of claim 1, wherein the caching server 35 
includes a client and a caching system and further 
including the steps of: 


30 


■ sending, by the client to the caching system, an 
instruction to purge the document .from the 
cache memory; and ' - - 

purging the document from the cache memory, 
by the caching system. T 

1 4. A caching server having a cache memory, the cach- 
ing server being part of a network thatjncludes an 
origin system connected to the caching server, the 
caching server comprising: * 

a first portion configured to request a catalog 
from the origin system that lists documents 
available from the origin system; 
a second portion configured to receive the cat- < 
alog from the origin system; 
a third portion configured to request a docu- 
ment described in the catalog, the document re- 
quested according to a predetermined criterion 
of the caching server; 


40 


a fourth portion configured to receive the doc- 
ument from the origin system; and 
a fifth portion configured to store the document 
in the cache memory. 

15. The caching server of claim 1 4, wherein the caching 
server includes a client and a caching system and 

. _ wherein the first portion includes: 

• - a sixth portion configured to send the request 

for the catalog from the client to the caching 
— ~ ■ system; and . . - / ; 

a seventh portion configured to send the re- 
quest for the catalog from the caching system 
to the origin system. 

16. A computer program product comprising: 

a computer usable medium having computer 
readable code embodied therein for causing 
access to information stored on a network hav- 
ing a origin system and a caching server, the 
computer program product comprising: 
computer readable program code devices con- 
figured to cause a computer to effect requesting 
a catalog from the origin system that lists doc- 
uments available from the origin system; 
computer readable program code devices con- 
figured to cause a computer to effect receiving 
' the catalog from the origin system; 
computer readable program code devices con- 
figured to cause a computer to effect requesting 
a document described in the catalog, the doc- 
ument requested according to a predetermined 
criterion; 

computer readable program code devices con- 
figured to cause a computer to effect receiving 
the document from the origin system; and 

• computer readable program code devices con- 
figured to cause a compute r to effect storing the 
document in the cache memory. 
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